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Learning from nature's amazing molecular machines, globular proteins, we present a framework for 
the predictive design of nano-machines. We show that the crucial ingredients for a chain molecule to 
behave as a machine are its inherent anisotropy and the coupling between the local Frenet coordinate 
reference frames of nearby monomers. We demonstrate that, even in the absence of heterogeneity, 
protein-like behavior is obtained for a simple chain molecule made up of just thirty hard spheres. 
This chain spontaneously switches between two distinct geometries, a single helix and a dual helix, 
merely due to thermal fluctuations. 



Significant advances in laboratory techniques for tai- 
loring and processing materials at the atomic scale have 
resulted in nanotechnology becoming an increasingly ma- 
ture field with great promise. One of the exciting goals 
of the field is the design of powerful machines, such as 
functional entitie s that can switch reversibly between dis- 
tinct geometries f^l^. Such machines would not only 
be of great use on their own but also could yield novel 
emergent behavior on networking them togethei^. The 
existence of life in its myriad forms provides a proof- 
of-concept of what one might aspire to accomplish with 
nanotechnology. 

Proteins are complex water-soluble chain molecules 
made up of tens or hundreds of twenty types of natu- 
rally occurring amino acids and exhibit conformational 
switching triggered by influences such as ligand bind- 
ing. At the nanoscale, thermal fluctuations yield forces 
with magnitude comparable to those involved in chemi- 
cal reactions catalyzed by the protein^. How might one 
design a machine whose functionality is structure-based? 
In its simplest form, we seek an object which takes on 
just a few distinct geometries in a reproducible manner 
and is able to switch reversibly between them due to ther- 
mal fluctuations. Such a situation would allow external 
stabilizing influences to favor a given conformation over 
the others and allow for the development of powerful ma- 
chines at the nanoscale. 

A collection of hard spheres constitutes the simplest 
model of matter and exhibits both a crystalline phase 
and a fluid phase on varying the density of sphered. A 
linear chain of hard spheres is the simplest connected 
object with the fewest constraints and thus the great- 
est flexibilit}'^ . Such a chain, at high temperatures or 
when there are no interactions promoting compaction, 
would occupy a random coil phase in which all self- 
avoiding conformations are equally likely. This situa- 
tion is not conducive for machine design. In the pres- 
ence of inter-sphere interactions promoting compaction 
and when there are no frustrating influences from the 
sphere tethering, one would expect a generic compact 
phase in which, at least locally, the preferred confor- 
mation is that of a face-centered cubic (fee) lattice. It 



was conjectured by Kepler, and proven more recently by 
Hales, that the fee structure provides for optimal packing 
of unconstrained spheres^^. There is again a high degen- 
eracy of compact conformations with no assurance that 
any given conformation will be reached from a random 
coil conformation rapidly and reproducibly. Again, this 
situation is not conducive for the design of machines. 

Of course, what one requires is a phase of matter with 
much fewer ground state conformations than either the 
random coil or generic compact phase. Additionally, one 
needs a phase which is in the vicinity of a phase tran- 
sition (or crossover for finite size systems) to a distinct 
phase, because that provides for exquisite sensitivity to 
the right types of external influences. Our thinking is 
guided by the liquid crystal phase^ which is a distinct 
state of matter that is poised in the vicinity of the liquid 
phase. This phase is known to be one of the most sen- 
sitive phases of matter. The liquid crystal phase opens 
up because of the anisotropy of the constituent molecules 
- there is no longer a need for simultaneous ordering in 
all three directions. Rather, one can have translational 
order in fewer than three dimensions and orientational 
order. Here we address the issue of how one might open 
up a distinct phase with relatively few ground state con- 
formations in the vicinity of the random coil phase. Such 
a phase would then be the analog of a liquid crystal phase 
but this time for chain molecules. 

Armed with insights from the liquid crystal phase and 
the behavior of proteins, we identify two mechanisms for 
thinning the number of candidate ground state structures 
and demonstrate that these are sufficient for the creation 
of the sought after behavior. One can define a Frenet 
reference frame (a local Cartesian coordinate system) at 
each location of the chain molecule comprised of the tan- 
gent, the normal, and the binormal as the orthogonal 
coordinate axes. These coordinate systems play a crucial 
role in at least two ways, as can be readily seen in the pro- 
tein context: first, the chemistry of hydrogen bonds and 
other chemical features yield constraints on the relative 
orientations between the coordinate systems associated 
with pairs of amino acids in contact; second, the side- 
chains of amino acids are located in a specified direction 





FIG. 1: Sketch of the chain molecule. The backbone is mod- 
eled as a chain of (blue) spheres of radius a with a separation 
along the chain / = 3.8 A. The nearest neighbor spheres along 
the backbone are allowed to overlap with each other thereby 
overtly introducing uniaxial anisotropy. ti and are the tan- 
gent and normal vectors assigned to each sphere, except to 
those at the ends of the chain. Side-spheres (shown in pink) of 
radius b are attached to the backbone spheres in the negative 
normal direction. The side-spheres are not allowed to overlap 
with either the backbone spheres or with each other. 



with respect to these coordinate systems - for example, 
the amino acid side chains are typically pointed approx- 
imately in the outward normal direction. Indeed, both 
the a-helix and the /3-sheet allow for the placement of 
side-chains in a manner that avoids steric clashes. 

The first mechanism follows from the observation that 
a model of a chain comprised of spheres is unable to cap- 
ture the inherent anisotropy induced by the presence of 
the chain constraint - at each location of the chain, there 
is a tangent direction defined by the two adjacent objects. 
Thus the simplest model capturing the correct symmetry 
is one in which the constituent monomers are no longer 
isotropic. We therefore allow for the overlap of van 
der Waals spheres of adjacent monomers along the chain. 
Such an overlap overtly breaks the isotropy of the spheres 
and confers uniaxial anisotropy to the chain. In both the 
emergent building blocks of protein structures, a-helices 
and /3-sheets, nearby segments are placed right up against 
each other and aligned parallel to each other, reflect- 
ing the anisotropy. In a protein, the distance between 
neighboring C*^ atoms is 3. 8 A requiring that the radius 
of the backbone of an amino acid be larger than 1.9 A. 
This constraint is easily met by considering the van der 
Waals radii of the constituent atoms. For example, one 
would estimat^^ that the smallest amino acid, glycine. 
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FIG. 2: Phase diagram of chain molecules made up of spheres. 
The panels (a, b) show the phase diagrams at T = in the 
a — Rc plane for chains of length N=12 without side-spheres 
{b — 0) and with side-spheres {b = 2.5A) respectively. The 
intermediate compact phase arises on the edge of compaction 
of the chain molecule when Rc becomes sufficiently large to 
allow the attractive interaction to be effective. The vicin- 
ity of this phase to other phases (it is sandwiched between 
two distinct phases) confers sensitivity to structures in the 
intermediate compact phase. The other panels show typical 
ground state conformations: a random coil conformation ob- 
tained with Rc < 2a (c), a globule conformation obtained 
with a = 2. 5 A, Rc = 9 A (d), a saddle conformation obtained 
with a — 3.2A, Rc = 7 A (e), several helix conformations 
with different pitch to radius ratios obtained for a = 2.6A, 
R^ = 6.4A (f), a = 3.2A, Rc = 8A (g), and a = 3.4A, 
Rc = 9 A (h). The helical and saddle ground state confor- 
mations in the model with no side-spheres are retained even 
when side-spheres are present {b = 2.5A). However, the side- 
spheres eliminate several random globule and random coil 
conformations thereby stabilizing the intermediate compact 
phase. 



has a radius larger than 2.4A. Second, we study the effect 
of attaching side-chains to the backbone monomer, e.g., 
in the negative normal direction. The self- avoidance of 
these side spheres with each other and with the backbone 
spheres results in an induced coupling between pairs of 
Frenet frames. 

We find that each of these features results in the cre- 
ation of a distinct intermediate phase for short chains 
sandwiched between the generic compact phase and the 
random coil phase. Furthermore, when the two fea- 
tures are combined, the phase is stabilized and occupies 
a larger region in parameter space. For our simplified 
model of a chain of backbone spheres (Fig. 1) with a 
sphere separation along the sequence of 3.8A, we have 
three length parameters: the radius, a, of the backbone 
spheres, the radius, 6, of the side-spheres, and the cut-off 
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FIG. 3: Multiple specific heat peaks corresponding to three 
distinct phases. We study a chain of = 16 with a = SA, 
b = 2.5A, and Rc = 7.5A. (a) The temperature dependence of 
the specific heat indicates two peaks - one at T ^ 0.2e/kB and 
another ai T ^ 0.97s /ks- The lower temperature peak indi- 
cates a crossover into a single helix conformation from a dual 
helix, whereas the peak at the higher temperature indicates a 
crossover between the dual helix and the random coil phases, 
(b) Contour plot of the effective free energy as a function of 
energy and contact order, CO, at the lower transition tem- 
perature. CO, is defined as the sum over sequence separation 
of all contacts divided by the product of the number of con- 
tacts and N. The effective free energy at a given temperature 
T is determined as T{E,CO) = -/csTlnP(^, CO), where 
P{E, CO) is a weighted two-dimensional histogram for that 
temperature obtained using the multiple histogram method^. 
The unweighted histograms at multiple temperatures are col- 
lected through parallel temperin^^ Monte Carlo simulations. 
The free energy difference between consecutive contour levels 
is 4:6. The contour plot shows two minima corresponding to 
the single helix and the dual helix as indicated. 



scale, Rc^ for the pairwise attractive interaction poten- 
tial, e/kB^ between the backbone spheres (See Materials 
and Methods). In the absence of side-spheres and for 
non-overlapping backbone spheres, a < 1.9A, there is a 
crossover from a random coil phase (in which the con- 
formations are mostly extended) to a globule phase in 
which one has compact conformations with no distinct 
motifs such as helices. The situation becomes qualita- 
tively different on incorporating one or both of the two 
key features. 

Fig. 2a shows the T = phase diagrams in the a-Rc 



plane for a = 12 chain. Distinct phases, corresponding 
to helical and saddle-like conformations, emerge between 
the coil and the globular phases analogous to the opening 
up of a liquid crystal phase between the liquid and crys- 
talline phases for anisotropic molecules^. The saddle can 
be thought of as a piece of a dual helix structure and does 
develop into a dual helix for longer chains. (The dual he- 
lix is distinct from the double helix because the former 
is a conformation of a single chain whereas the latter is 
comprised of two chains.) These phases have a lower en- 
tropy than both the globule and random coil (swollen) 
phases but are stabilized by the attractive interaction 
potential. We also find that the presence of side-spheres 
even without overlap of the backbone spheres induces 
helical conformations (Figure SI in SI). This effect is ac- 
centuated when both features are present simultaneously 
(Figure S2 in SI). Interestingly, conformations composed 
of nearly parallel strands similar to those in /^-sheets can 
be obtained by adding a bending energy into the model 
(Figure S3 in SI). 

We now consider an even shorter system {N = 16) with 
an enhanced overlap of the backbone spheres: a = 3.0 A. 
The radius of the side-sphere is chosen to be 6 = 2.5A, 
and the range of attraction is now increased to Rc = 
7.5A. These parameter values yield a helix ground state 
but is close to a crossover to the dual helix state. Fig. 3a 
shows that the specific heat has two peaks correspond- 
ing to two crossover temperatures, Ti ^ 0.2e/kB and 
T2 ~ 0.97 e/ks' At the lowest temperatures, the helix 
is the dominant conformation. At intermediate temper- 
atures, one obtains the dual helix structure, whereas, at 
higher temperatures, one obtains a random coil. Fig. 3b 
shows the contour plots of the free energy on the energy- 
contact order plane at Ti and exhibits pronounced min- 
ima corresponding to the single helix and dual helix. The 
helix is a bit lower in energy while the dual helix is en- 
tropically more favorable. The free energy barrier for 
the switching at Ti is about Ae which is equivalent to 
the energy required to break 4 contacts. This barrier is 
significantly higher than ksTi and the two helical confor- 
mations are each quite stable at this temperature. Quite 
remarkably, our Monte Carlo simulations with standard 
pivot and crankshaft move sets show a dynamical switch- 
ing between the two conformations in a single trajectory 
at a somewhat higher temperature oiT = O.be/kB- The 
system is bistable with a rapid switching between dis- 
tinct conformations (for examples of molecular switches, 
see Reis^^ with little weight for intermediate confor- 
mations (Fig. 4). Note that the switching can also be 
easily effected by means of an external influence which is 
sensitive to the chain end-to-end distance. 

It is important to note that the ease with which one 
obtains this system without any fine-tuning of details is 
made possible by the existence of the intermediate phase. 
The conformations in this phase are in the vicinity of 
the random coil phase while retaining order due to the 
proximity of the compact phase. This results in special 
sensitivity to small perturbations induced, e.g., by ther- 
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mal fluctuations which, in the example presented here, 
are responsible for the switching between two distinct 
geometrical shapes. Unlike the random coil conforma- 
tions which can switch from one to another easily due 
to thermal fluctuations, the structures in the interme- 
diate phase exhibit some stability. At the same time, 
the structures are not so densely compact that they are 
subject to sluggish dynamics and kinetic inaccessibility 
characteristic of the glassy phase. These distinct advan- 
tages of the intermediate phase have a wider applicabil- 
ity than for chain molecules as evidenced by the sensitive 
liquid crystal phase sandwiched between the crystalline 
and liquid phases. In liquid crystals, the anisotropy arises 
from the asymmetric shape of the constituent molecules, 
whereas here the anisotropy is a natural consequence of 
the chain topology of the m olecule. Interestingly, earlier 
computational studieJff^^ had found that overlapping 
adjacent monomers yield helical conformations. Also 
protein- l ike folds are ado pted by a host of non-biological 
^^^^^^^^Mmnmimm (see Appendix). 

Understanding the properties of matter is greatly sim- 
plifled on using the concept of its phases^. For example, 
a liquid possesses certain gross properties, such as adopt- 
ing the shape of the container and its ability to flow, irre- 
spective of its constituent molecules and their underlying 
chemistry. Globular proteins share a great deal of com- 
mon characteristics - they are all linear chains of the same 
20 amino acids, they fold rapidly and reproducibly into 
their native state structures, these structures are made 
up of emergent building blocks in the form of helices and 
almost planar sheets, the total numbe r of d istinct folds 
that proteins adopt is limited in numbei'^^^ much as the 
number of space groups associated with Bravais latticed 
is 230, proteins are flexible and versatile in their folded 
state, and proteins have a tendency to aggregate and 
form amyloid which in turn is implicated in debilitating 
human diseases. These common attributes of proteins 
suggest that pr otein structures occupy an intermediate 
phase of mattei'I^'^ which confers on them their many 
amazing characteristics. 

Whitesides wrote^ that one ought to "take existing 
nanomachines-those present in the cell-and learn from 
them. We will undoubtedly be able to extract from these 
systems concepts and principles that will enable us to 
make variants of them that will serve our purposes, and 
others that will have entirely new functions" . The phase 
of matter so successfully used by nature as the basis of 
life is ready to be exploited in the laboratory. 



Methods 

We consider chains of N hard spheres each of radius 
a. The beads spacing along the chain is flxed to be 
/ = 3.8A. Non-consecutive spheres are not allowed to 
overlap, whereas the neighboring spheres along the chain 
can overlap when a is larger than 1.9 A. Non-consecutive 
spheres interact via a pairwise square- well potential equal 
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FIG. 4: Dynamical switching between the single helix and the 
dual helix, (a) A long Monte Carlo trajectory, with standard 
pivot and crankshaft move setJ^, at T = 0.5e:/A:s, shows fre- 
quent switches between the single helix (lower contact order) 
and the dual helix (higher contact order). The chain studied 
here is the same as in Fig. 3. Both the single and the dual 
helix have similar energies, (b) Similar to panel a but for a 
much smaller window of time steps and with higher resolution 
showing a single switch from the single helix to the dual he- 
lix. The bottom figures are the snapshots during the switch 
at times t (in units of 10^ steps) as indicated. 



to —e within a contact range, Rc. A Frenet frame of ref- 
erence is assigned to each sphere, i, except to those at the 
ends of the chain. The tangent vector, t^, in this frame 
is a unit vector tangential to the circle passing through 
the centers of beads i — l^i and The Frenet normal 

vector, n^, is an unit vector pointing to the center of this 
circle. The third vector of the frame is the binormal vec- 
tor, denoted by b^, and is defined to be the cross product 
of the tangent and the normal vectors. A side-sphere of 
radius b is attached to each backbone sphere in direc- 
tion opposite to the normal with the distance between 
the centers of the backbone sphere and its side-sphere 
partner equal to a + 6. The backbone spheres at the two 
ends of the chain do not have any side-spheres attached 
to them. The role of the side-spheres is entirely steric 
- they are not allowed to overlap with any of the other 
spheres in the system. The energy of the chain in given 
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conformation can be written as 

N-2 N 

E = -eJ2 E mc-\n-rj\), (1) 

where fi are center positions of the backbone spheres, TV 
is the number of such spheres, and the step function Q{x) 
is equal to 1 if x > and otherwise. 

We employ a parallel temperin^^ Monte Carlo (MC) 
scheme for obtaining the ground state as well as other 
equilibrium characteristics of the system. The simulation 
entails monitoring 20-30 replicas, each evolving at its own 
selected temperature, T^. For each replica, the simula- 
tion is carried out with the standard pivot and crankshaft 
move sets^^ and the Metropolis algorithm for move accep- 
tance. In a pivot move, one randomly chooses a sphere i 
in the chain and rotates the shorter part of the chain (ei- 
ther from 1 through z — 1 or from through where 
N is the number of spheres in the chain) by a small angle 
and about a randomly chosen axis that goes through the 
i-th sphere. In a crankshaft move, two spheres i and j 
are chosen randomly such that — < 6 and the spheres 
between i and j are rotated by a small angle and about 
the axis that goes through i and j. In both move sets, 
the rotation angle is randomly drawn from a Gaussian 
distribution of zero mean and a dispersion of 4^. An at- 
tempt to exchange replicas i and j is made every 100 
MC steps. The exchange is accepted with a probability 
equal to pij = max {l, exp[A:^^(7;"^ - T~^){Ei - Ej)]}, 
where /c^ is the Boltzmann constant, and Ei and Ej are 
the energies of the replicas at the time of the exchange. 
The weighted multiple histogram technique^^ is used to 
compute the specific heat and the effective free energy. 



Poly(ethylene glycol) (PEG) has been shown to as- 
sume a helical conformation in isobutric acid with 
a trace amount of wate r without which PEG forms 
a coil configuratior^^^. PEG also forms helices in 
isopentanoic and n-propanoic acids but not in isobu- 
tanol or n-butanoP^. In isobutyric acid, PEG forms 
a mixture of helices and coils whereas a similar poly- 
mer poly(ethy-lene imine) (PEI) merely forms helices. 
Phenylacetylene oligomers of specific chain lengths 
(up to 18 units) and containing a tri(ethylene oxide) 
side-chain segment at each repeat unit have been found 
to undergo sharp switching between arrays of random 
coils and arrays of helical conformations on changing 
the solvent compositioiP^. Another interesting example 
is provided by poly(ethylene oxide) (PEO) dissolved in 
an electrolyte consisting of a lithium salt^^, LiCFsSOs. 
In the crystalline phase of the system, the PEO chains 
are helical and form parallel arrays. In the amorphous 
phase, the arrays dissolve into separate helices. Finally, 
there are several examples of synthesis of helical poly- 
mers that are discussed by Sanda et al.^ and which 
include polychloral, polyisocyanates, polyisocyanides, 
polisilanes, and polycetylenes. Such helical structures 
can be made to have adjustable geometry and then 
used to generate nanocavities of tunable sizes^^. Sanda 
et al.^^ have recognized that the crucial factor for the 
synthetic polymers to adopt the helical structures is the 
steric repulsion between the side chains combined with 
attraction that is usually provided by hydrogen bonding. 
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Figure SI: Correlation functions for a chain of non- 
overlapping spheres. The tangent- tangent, normal- 
normal, and the binormal-binormal correlations as a 
function of sequence separation are shown for a chain 
of N = 50 non-overlapping spheres (a = 1.9 A) with 
Rc = 6.bA. The chain is studied at T = O-G^/Zc^ which is 
below the collapse transition. Panel a corresponds to the 
situation without the side-spheres (6 = 0) and panel b 
with side-spheres {b = 2.5A). The tangent-tangent corre- 
lation for a sequence separation s is defined as (t^ • t^+g) 
where the average (•) is taken over all i running from 2 
to {N — s — 1) and over all sample conformations. The 
normal-normal and binormal-binormal correlations are 
defined similarly. The correlations are averaged over a 
run of 10^ MC steps, in which conformations are con- 
sidered every 10^ steps. The conformations displayed at 
the bottom are typical snapshots obtained in the simula- 
tion at two different temperatures as indicated. The two 
conformations on the left are for 6 = and the remain- 
ing two are for b = 2. 5 A. Note that the presence of the 
side-spheres induce helical conformations and yield qual- 
itatively distinct behaviors of the correlation functions 
of the tangent, normal, and binormal vectors along the 
chain. 
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additional bending energy along the chain. The parame- 
ters are a = 2. 5 A, h = 2.5 A and Rc = 6. 5 A. The bending 
energy is given by ^^^2^ ei^(l— cos(^^)), where cr is a co- 
efficient and Oi is the angle between two consecutive con- 
nectivity vectors r^_i^^ = — r^_i and r^^^+i = r^+i — 
associated with the sphere i. The conformations shown 
are for cr = 2s (a), As (b) and 8s (c), respectively. Note 
that for the largest value of cr, one obtains a planar 
hairpin-like conformation. 
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Figure S2: The crossover between the single helix and 
the random coil phases in a chain of = 50 overlap- 
ping spheres. The parameters are a = 2. 5 A, b = 2. 5 A, 
and Rc = 6.5A. The ground state for this system is 
a single helix, (a) Dependence of the specific heat on 
temperature showing a peak at T^^ax ^ 0.72s /kB- (b) 
Dependence of the tangent-tangent correlation on the se- 
quence separation for three different temperatures: well- 
above the transition (T = 1.6s /ks)^ just above the 
transition (T = O.Ss/kB) and just below the transi- 
tion (T = 0.6s /ks)' (c) Snapshots of typical confor- 
mations at the three temperatures. The specific heat is 
obtained from fiuctuations in the energy using the equa- 
tion C = {{E'^) - {Ef)/{kBT'^), where E is the energy 
and (•) indicates an average. Note that the signature 
of the helical ground state is present in the correlation 
functions even at a temperature distinctly higher than 
Tmax showing that information about the ordered low 
temperature state is encoded even in the high tempera- 
ture phase. 
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Figure S3: Ground state conformations of a chain of 
N = 16 overlapping spheres with side-spheres and an 



